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Abstract 

The traditional node percolation map of directed networks is reanalyzed in terms of edges. In 
the percolated phase, edges can mainly organize into five distinct giant connected components, 
interfaces bridging the communication of nodes in the strongly connected component and those 
in the in- and out-components. Formal equations for the relative sizes in number of edges of 
these giant structures are derived for arbitrary joint degree distributions in the presence of local 
and two-point correlations. The uncorrelated null model is fully solved analytically and compared 
against simulations, finding an excellent agreement between the theoretical predictions and the 
edge percolation map of synthetically generated networks with exponential or scale-free in-degree 
distribution and exponential out-degree distribution. Interfaces, and their internal organization 
giving place from "hairy ball" percolation landscapes to bottleneck straits, could bring new light 
to the discussion of how structure is interwoven with functionality, in particular in flow networks. 

PACS numbers: 89.75.Hc, 64.60.Ak 
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I. INTRODUCTION 



The theory of percolation applied to random networks 1| has proven to be one of the 



4|. Its importance goes beyond 



most notorious advances in complex networks science j^, |3|. 
the production in the short term of theoretical results, which are general and relevant to 
systems in many different fields. The implication are far-reaching. On one hand, a number 
of different problems have a direct interpretation in terms of percolation or can be mapped 
to it, such as the study of resilience or vulnerability in front of random failures [5] or SIR 



epidemic spreading models 



ll| . On the other hand, the emergent percolation 



landscape can strongly affect properties such as fluency or navigability in self-organized 
systems. Hence, the conformation of connectivity structures in the percolated phase should 
ensure efficient communication at the global level so that different parts of the system- 
individuals, modules, or substructures- are able to interact for the whole to organize and 
develop functionality. 

In the case of undirected networks, where elements are linked by channels operating in 
both directions, the basic percolation discussion was centered around the appearance of a 
macroscopic portion of connected nodes that are linked through undirected paths and so can 
communicate among them. The critical point for the appearance of this giant component 

nnnnn 

and its relative size in number of nodes and edges w as determined [5|, H2J, H3|, H4J, |15J], also in 



the presence of specific structural attributes 

in directed graphs H Q, q, y , s 
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211 ]. In its turn, the standard 
251 ] establishes that this giant connected 



component may become much more complex and internally organized in three main giant 
structures, the in-component, the out-component, and the strongly connected component, as 
well as other secondary aggregates such as tubes or tendrils. This conformation, sometimes 
represented as a bow-tie diagram 26j, denotes a potential global flow -of matter, energy, 
information...- organized around a core which usually processes input into output. 

In this work, we will see that the percolation landscape, the aggregate of macroscopic 
connectivity structures in the percolated phase above the critical point, is further shaped 
when edges, the 0-level primary building blocks of networks along with nodes, are taken as 
starring elements. Five distinct components are found to be relevant in the edge percola- 
tion map of directed networks, the traditional strongly connected and the in and out node 
components, and two newly identified interfaces bridging the communication between them. 
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In Sec. HI1 we define the relevant components and present analytical computations based on 
the generating function formalism and the usual locally treelike assumption for their rela- 
tive size in number of edges in purely directed random networks that can present local and 
two-point correlations. In Sec. IIII[ the formal equations for the most general situation will 
be reformulated for the prototypical null model of uncorrelated networks. The correspond- 
ing analytical results will be compared to simulations for networks with exponential in and 
out degree distributions and to numerical solutions associated to networks with scale-free 
in-degree and exponential out-degree distributions. A discussion of the implications coming 
out of this description will be provided in Sec. IIVI where the concept of interface will be 
further examined along indications of the potential relevance of its internal structure, that 
could organize to produce from "hairy ball" percolation landscapes to bottleneck straits. 
We end by summarizing and giving some final remarks in Sec. [V] 



II. EDGE COMPONENTS IN DIRECTED NETWORKS 

In the traditional node percolation map of directed networks the core structure is the 
giant strongly connected component (GSCC), where all vertices within can reach each other 
by a directed path. When present, it serves as a connector of the giant in-component (GIN), 
composed by all vertices that can reach the GSCC but cannot be reached from it following 
directed paths, to the giant out-component (GOUT), made of all vertices that are reachable 
from the GSCC but cannot reach it following directed paths. 

From the point of view of edges, the GIN and the GOUT unfold into two structures each, 
the edge in-component (ICE) and the in interface (ITF), and the edge out-component (OCE) 
and the out interface (OTF) respectively, so that five giant components should indeed be 
distinguished. This increase in the number of relevant structures is a consequence of the 
fact that nodes are point objects and they belong to just one of the three node components, 
whereas edges can be considered as extended objects in the sense that they could belong 
simultaneously to two different node components, having for instance one end in the GIN or 
GOUT and the other in the GSCC. This fact points to the necessity of defining new classes 
for edges. We will not take into account aggregates such as tendrils or tubes, so that edges 
will be classified into five different categories depending on the affiliation of the nodes they 
are joining. Let us recall that, in the node percolation map, the out- and in-components of 
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individual vertices are defined as the number of vertices (plus itself), Sj, that are reachable 
from a given vertex and the number of vertices (plus itself), s , that can reach that vertex, 
respectively. The GSCC can be thus thought of as the set of vertices with infinite in- and 
out-components simultaneously, and the GOUT and GIN as the set of vertices with infinite 
in-component and infinite out-component respectively, excluding the GSCC. Taking this 
into consideration, we give the following definitions for the different principal components 
of the edge percolation map of random directed networks: 

• The edge in-component, ICE, is the set of edges joining source and destination nodes 
with finite in-component and infinite out-component. These edges are connecting 
nodes within the GIN. 

• The in-interface, ITF, is the set of edges joining source nodes with finite in- 
component and infinite out-component and destination nodes with infinite in- and 
out-components. These edges are bridging the ICE and the SCE (see below) by con- 
necting nodes in the GIN to nodes in the SCC. 

• The edge strongly connected component, SCE, is the set of edges joining source and 
destination nodes with infinite in- and out-components. These edges are connecting 
nodes within the SCC. 

• The out-interface, OTF, is the set of edges joining source nodes with infinite in- 
and out-components and destination nodes with infinite in-component and finite out- 
component. These edges are bridging the SCE and the OCE by connecting nodes in 
the SCC to nodes in the GOUT. 

• The edge out-component, OCE, is the set of edges joining source and destination nodes 
with infinite in-component and finite out-component. These edges are connecting 
nodes within the GOUT. 

The critical point for the simultaneous appearance of the three giant node components 
-as well as other secondary structures such as tubes or tendrils- trivially marks also the 
emergence of the five giant edge components. In the most general case, the condition A m > 
1 characterizes the percolated phase, where \ m stands for the maximum eigenvalue of a 
characteristic matrix. In the case of purely directed random networks, where the main 
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FIG. 1: (color online). Schematic representation of the main giant components in the edge perco- 
lation map. As illustrated in the sketch, the different components can be heterogeneous in their 
sizes. 



attribute of each node is its degree k = (fcj, k Q ) determined by its incoming and outgoing 
number of connections ki and k Q , the characteristic matrix in the presence of two-point 



correlations was found to be C£ k / (or C kk / with the same results) [231 ]. 

Cfc = k' P (k'\k) 

(1) 

C kk , = *jP,(k'|k), 

where the transition probabilities Pj(k'|k) and P (k'|k) measure the likelihood to reach a 
vertex of degree k' leaving from a vertex of degree k using an incoming and an outgoing 
edge, respectively. If the degrees of connected vertices are statistically uncorrelated, this 
condition reduces to the first-born [ijj] 

Y,ko(h-l)P(k u k o )>0, (2) 

where P(ki, k a ) = -P(k) is the joint degree distribution of in- and out-degrees, that could 
encode local correlations. 



A. Analytical computation of edge components size in purely directed networks 



In order to compute the sizes of the different giant components in number of edges, 
the already traditional approach used in previous developments is also appropriate with 
necessary adjustments. The mathematical methodology is based on the generating function 
formalism while the physical methodology explores the network with branching processes 
which expand under the locally treelike assumption 
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231 ] . Maximally random purely 
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directed networks with local and two-point correlations will be considered. This implies that 
the relevant information about the topology of the network is encoded in the joint degree 
distribution P(k, k'), where k is the degree of the source node and k' the degree of the 
destination node, or, equivalently, in the degree distribution P(k) along with the transition 
probabilities Pj(k'|k) and P D (k'|k). These are related through the following degree detailed 



balance condition 



23,1221 



fc P(k)P (k'|k) = ^(kOPtklk'), (3) 

which is fulfilled whenever any edge leaving a vertex points to another or, in other words, 
whenever the network is closed and does not present dangling edge ends. Although the 
condition is satisfied for the whole graph, the three node components -GIN, GSCC, and 
GOUT- do not fulfill the detailed balance condition separately. If one restricts to consider 
the nodes within the boundaries of each component along all their connections, dangling ends 
can be found. The interfaces are just the sets of edges that prevent the node components 
from fulfilling the detailed balance condition separately. 

Apart from the distributions above, the calculations also rely on the edge joint distribu- 
tion G(si, s Q ; s' { , s' ) associated to directed edges joining source and destination vertices. It 
measures the simultaneous occurrence of finite sizes for the different single node components 
associated to the connected vertices. More specifically, it measures the number of vertices 
(plus itself), s , that are reachable from the source vertex and the number of vertices (plus 
itself), that can reach the source vertex, simultaneously to the number of vertices (plus 
itself), s' Q , that are reachable from the destination vertex and the number of vertices (plus 
itself), s'i, that can reach the destination vertex. Notice that if computations are done for 
node components, the relevant distribution is G(sj, s Q ) and refers to just one node. Accord- 
ing to the definitions above, and as a function of the edge joint distribution, the relative 
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sizes of the different giant edge components can be formally written as 

s t s >. 

9oce = G ( Si = 00 ' S ° ; S 'i = 00 ' 

^/ = X] G ( Sj = 00, s D = 00; s- = 00, s'J 

5f sce = G(sj = 00, s = 00; s- = 00, s' Q = 00), (4) 

where we have made use of the fact that if the destination node has an infinite out-component 
so it has the source node and, analogously, if the in-component of the source node is infinite 
so will be the in-component of the destination node. These functions can be computed from 
the marginal distributions associated to G(si, s Q ; s[, s' ), which preserve just some of the four 
variables. Their dependence on a given variable Sj/ indicates that the corresponding in or 
out-component of the source or destination vertex (destination vertex with prima) is finite 
with size Sj/ G regardless of the size of the rest of the involved single node components. For 
instance, the function G(si, ; s[, ) measures the probability of an edge connecting a source 
node with finite in-component of size Sj to a destination node with finite in-component of 
size sj, regardless of the sizes of the out-components of connected nodes, that could be finite 
or infinite (notice that for ease of notation we just left blank the spaces corresponding to 
the marginalized variables). In terms of these marginal probabilities, the relative sizes of 
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the main components are: 

9ice = X^ G(sj, ; Sj, ) — X G(si, ; s-, s„) 

g oce = X C(, s G ; , s' ) — X G(sj, s G ; , s' ) 

= X G ( Si ' ; ' ) ~ X G ( s *' ; s '" ) ~ 
X ; , s'J + X 

9otf = X G (' ' ' s °) ~ X G(, s d ; , s' ) - 

3 sce = 1 - J] G(si, ; , ) - X G (- - s o) + 

Z G ( s -> s o)- (5) 

These marginal probabilities depend on the degrees of the nodes at the ends of the edge 
under consideration. Edges connecting nodes in the same degree classes will be considered 
statistically equivalent, so that these functions should be rewritten over joint degree classes. 
For instance, 

G(s h ; Si , s' ) = X p ( k - k 'Ms,, ; Si, s' \k, k'), (6) 

k,k' 

and analogously for the rest. To calculate these conditional probabilities we have to in- 
troduce at this point the probability functions g (s\k) and <7j(s|k), which represent the 
distributions of the number of reachable vertices from a vertex, given that we have arrived 
to it from another source vertex of degree k following one of its outgoing or incoming edges, 
respectively. These functions are exactly the same as those already introduced in previous 
works for the computation of the sizes of the GIN, GOUT and GSCC. The marginal con- 
ditional probabilities can then be expressed as functions of these single-node probabilities, 
that in its turn obey closed equations obtained from an iterative procedure which applies 
the techniques of random branching processes under the locally treelike assumption. This 
hypothesis is correct if the length of cycles present in the network is of the order of its diam- 
eter, so that the sizes of single node components can be exposed by subsequent jumps from 
neighbors to neighbors of neighbors without returning to already visited ones (the presence 
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of lower order loops would induce overcounting). In this way, the problem can be formally 
solved in the general correlated case. 

As a way of example, it will suffice here to provide the expression of one of the marginal 
conditional distributions as a function of g (s\\t) and <fe(s|k) to illustrate the derivation. 
Assuming the locally treelike condition, one of the two relevant marginal conditional prob- 
abilities in the computation of the ICE can be written as 

G(si, ; s[, s'Jk, k') = £ 9i(s\\k) • • ■ gi(sl.\k)5 s ^ + ... +s ^ +ljS . 

x ^( s ii k ')---^(^-ii k ')^ + ^ + ... + ^_ i+M 

x £ ^( S ' 1 |k')--^ (^|k')^ 0+ ... +slo+ljS ,. (7) 

This expression for the joint multi-component conditional size distribution G(sf, s-, s' \k, k') 
needs three simultaneous computations: the number of vertices that can reach the source 
node, the number of vertices that can reach the destination node, and the number of nodes 
that the destination node can reach itself. The procedure starts from an edge linking nodes 
of degrees k and k' and splits the sets Sj, s[ and s' Q into the different contributions associated 
to the corresponding neighbors. For instance, the number of edges that bring to the degree- 
k source node, Sj, can be computed as the sum of the different contributions that can 
reach each of its ki incoming neighbors, s\ + ■ ■ ■ + s k . . This corresponds to the first set of 
summations of the three that appear in Eq. ([7]). Independent equations for the functions gi 
and g Q can be found by expanding iteratively this procedure: 

ffi(s|k) = V^k'Ik^Silk') • • • gi(s k Ak')6 s k ,, s 

' • i 

k' 

g (s\k) = £P (k'|k)^( Sl |k / )---(? (^|k / )55 fc ,, s , (8) 
k' 

where S k > = Si H — ■ + sy. + 1 and St' — «i H — • + sy + 1- These equations become tractable 

i i o o 

using the generating function formalism. In mathematical terms, generating functions are 
obtained by applying the transformation f(z) = J2 s f(. s ) zS > so ^ na t functions are brought 
to the discrete Laplace space. Once transformed for the variables s, Eqs. (IE]) become closed 
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for gi and g , 

k' 

g (z\k) = ^P (k'|k)^|k') fe °. (9) 

k' 

All summations over finite sizes of the joint conditional size distributions correspond to their 
generating functions evaluated at z — 1. Eventually, those depend on ^(l|k) and ^(l|k): 



6(1,; 1,1) 


= ^P(k,k')^(l|k) fe ^(l|k')^- 1 ^(l|k') fc ° 

k,k' 


6(1,1;, 1) 


= ^P(k,k')^(l|k)^ (l|k) fc -^ (l|k') fe ° 
k,k' 


G(,l;,l) 


= ^P(k,k')^(l|k) fe -^ (l|k') fc ° 
k,k' 


G(l,;l,) 


= ^P(k,k')^(l|k)^(l|k') fc - 1 
k,k' 


G(l,;,l) 


= ^P(k,k')^(l|k)^ (l|k') fe ° 
k,k' 


G(,;,l) 


= ^P(k,k')^(l|k') fe ° 
k,k' 


G(l,;,) 





k,k' 



These expressions will allow us to compute easily the relative sizes of the different compo- 
nents: 

g ice = (7(1, ;1,)- (7(1,; 1,1) 

g oce = G(,l;,l)-G(l,l;,l) 

g uf = G(l,;,)-G(l,;l,)-G(l,;,l) + G(l,;l,l) 

g otf = G(,;,l)-G(,l;,l)-G(l,;,l) + G(l,l;,l) 

g sce = 1-G(1,;,)-G(,;,1) + G(1,;,1). (11) 

Notice that the sizes of the interfaces can also be written as 

9itf = G(l,;,)-G(l,;,l)-g ice 
g otf = G(, ; , 1) - 6(1, ; , 1) - g OC e- 

(12) 
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The set of Eqs. (l9])- (flQ]) - (ITT]) determines completely the relative sizes in number of edges 
of the main giant components of the edge percolation map of two-point correlated purely 
directed networks. 



III. UNCORRELATED PURELY DIRECTED NETWORKS 

The formal solution given in the previous section becomes simpler for the classical null 
model of uncorrelated networks. This will allow us to perform further analytical computa- 
tions that will be checked against simulation results in order to contrast the accuracy of the 
theory. 

The absence of two-point correlations make possible to factorize the joint degree distri- 
bution, and the conditional degree distributions also simplify: 

p(kk , ) = ww) (13) 



and 



P„(k'|k) = , P i( k'|k) = (14) 

In this situation, Eqs. ([9]) evaluated in z — 1 reduce to 

$ (l|k) EE Ul) = Y, k -^Ul) K , (15) 

so that the relative sizes in number of edges of the different components in the uncorrelated 
case just depend on the joint degree distribution P(k) and the single-node in and out 
generating functions g«(l) and <7o(l), and can be written as 

= E^^(i) fe Hi-<7 (i) feo ) 

k ^ 

= E^f^^(l) fe °(l-Ml) fci ) 
9itf = - 9oiX)) - 9ice 

g sce = (1 -&(!))(! -&(1)). (16) 
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If local correlations are also absent, the expressions above become even simpler 

ki 



9ice = (1 -9oQ))^2 



9o 



;i-mi))E 



(h) 

k g {l) k °P{k ) 



(ki) 



with 



9itf = (1 -&,(1))&(1) - 9ice 
9otf = (! )&>(!) - 9oce 

g sce = (1-&(1))(1-&(1)) 

g i (l)=J2P(h)9i(l) ki 

k'. t 

g (l) = Y,P(ko)g (l) k °. 



(17) 



(18) 



A. Comparing against simulations 

In order to ascertain the accuracy of the theory, we contrast the analytical results with 
those obtained from simulating uncorrelated purely directed networks with given joint degree 
distribution of the form P(k) = P(ki)P(k ). Uncorrelated networks are generated according 
to a slightly modified version of the Molloy-Reed prescription [jjl, Q -which is based on 



the configuration model 



28 



29j and constructs maximally random networks with a given 



degree sequence- to produce directed connections controlling that £V ki = Y^ a ^° an< ^ a ^ so 
taking care of avoiding multiple or self-connections. 

1. Exponential in- and out-degree distributions 

For the first case study, we chose P(ki) and P(k ) of the form 



Pn 



P(k) = < 



a - p o) 2 
(k) 



i - 



(k) 



k-l 



k = 



k > 1 



(19) 



so that a full analytical solution is available. The sizes of the giant components in the edge 
percolation map for this particular joint degree distribution just depend on the parameters 



12 




FIG. 2: (color online). Relative sizes of the main giant components in the edge percolation map 
of networks with exponential in- and out-degree distributions as a function of the average degree. 
Simulation results (dots) correspond to 1-realization measures on synthetic networks with N = 10 5 
vertices, Pq = 0.4, and Pq = 0.8. Solid lines are the analytical solutions Eqs. (f20"j) - ([2"T|) . 



Pq and Pq and the average degree (k^) = (k Q ). Substituting Eqs. ([HI into Eqs. f[TB"j) . the 
solutions are found to be 



and the relative sizes 



pi JDO 1 p 

q l q° (h) 



9ice 
9oce 

9itf 
9otf 

9 'see 



&(!)(! -&(!)) 

(k) 2 

9o (l)(l - g t (l)) 

(ki) 2 
gi(l)(l - g a (l)) 

(ki) 2 

g (l)(l-gm (n , 

(ki? 



((h) 2 -i) 
((h) 2 -i) 



(1 -&(!))(! -So(l)). 



(20) 



(21) 



We compared these results with direct measures of the edge components on a synthetic set 
of purely directed random networks with iV = 10 5 . We fix the values Pq = 0.4, Pq = 0.8, and 
vary the average degree from (ki) = 1 to (ki) = 10. As Fig. [2] shows, the conformity of our 
formulas to the simulation results is excellent. Notice also that for this particular choice of 
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FIG. 3: (color online). Relative sizes of the giant components in the edge percolation map of 
uncorrelated networks with scale-free in-degree distribution, 7 = 2.2, and exponential out-degree 
distribution, Pg ut = 0.4, as a function of the average degree. Simulation results (dots) correspond 
to synthetic networks of size N = 5 • 10 5 vertices, 3 realizations for the first two point and one 
realization for the rest. Solid lines correspond to the numerical solutions of Eqs. p!7|) - (fT8]) . 

the parameters Pq and Pfi, the out-interface, OCE, is by far the biggest edge component in 
the percolated phase for all values of the average degree above approximately 1.5 , followed 
with a noticeable difference by the edge strongly connected component, SCE, and the in 
interface, ICE. In this example, the interfaces are much stronger than the edge in- and out- 
components, practically absent for high degrees. This edge percolation map is seen to be 
quite stable for most of the average degree range (see Sec. [IV] for further discussion). 



2. Scale-free in-degree and exponential out-degree distributions 



In some real networks, such as the WWW [30|] for instance, the in-degree distribution 
exhibits a heavy-tailed form well approximated by a power-law behavior P(ki) ~ A;" 7 , at 
the same time that a different functional dependence is faced in the case of the out-degree 
distribution P(k a ), which can present clear exponential cut-offs. In biology, transcriptional 
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regulatory networks are characterized by the reflexive situation, in which an incoming degree 
distribution that decays faster than a power law can be observed along with a scale-free 
outgoing degree distribution [3l| . It is then particularly interesting to see what happens for 
power law in- or out-degree distributions when combined to exponential out- or in-degree 
ones. In this example, the in-degree distribution is taken to follow a scale-free form of the 
type 

P k = 

(22) 



Pi(k) 



k>l 



where £(7) is the Zeta Riemann function. The out-degree distribution is given again by 
Eq. (fT9|) . The set of Eqs. (ITS]) is solved numerically and plugged into Eqs. OTj) to get the 
relative sizes of the edge components and the results are compared to direct measures of the 
edge percolation map on a set of synthetically generated networks with N = 5 ■ 10 5 nodes. 
We take 7 = 2.2, Pq = 0.4 and vary the average degree (hi) = (k Q ) until the maximum 
possible value is reached by adjusting Pq. This upper boundary in the average degree is 
due to the fact that, since Pq = 1 — (&i)C(7)/C(7 — 1)> values above the threshold impose a 
negative Pq and are not realizable. The theoretical value for this threshold is £(7 — l)/£(7). 

Once again, our predictions compare extremely well with the measures on the simulated 
networks, see Fig. [3j Interestingly, and in contrast to what was obtained in the previous 
example, the edge percolation map changes dramatically depending on the average degree. 
For small values -but big enough to ensure that the system is in the percolated phase, 
(ki) > 1— , the edge in-component, ICE, and the in-interface, ITF, are predominant. However, 
the rest of edge components grow steadily with the average degree while those reach a 
maximum and then decay to eventually disappear at the average degree threshold, so that 
for high values of the average degree the edge strongly connected component, SCE, and the 
out-interface, OTF, dominate. 



IV. INTERFACES 



Interfaces arise as distinctive elements of the edge percolation map. From the analytical 
computations one sees that the interfaces are also giant components. Furthermore, their size 
could be much larger than that of the ICE and OCE, for instance as shown in Figs. [2] and 
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<k> 

FIG. 4: (color online). Ratio of the relative sizes of the giant interfaces to the edge in- and 
out-components as as a function of the average degree. Simulation results (dots) correspond to 
1-realization networks of size N = 10 5 vertices with Pq = 0.4 and Pfi = 0.8. Solid line correspond 
to the analytical ratio Eq. (|23p . 

|3l In the particular case of the completely uncorrelated networks with exponential in- and 
out-degree distributions given by Eq. (jT9l) . the relative sizes of the interfaces as compared to 
that of the pure components can be calculated analytically and found to be 

9*1 = 9°tf_ = {h) 2 _ L (23) 

dice Qoce 

The same relation is numerically seen to happen for the ratio between the out-interface and 
the edge out-component of the second case study where the in-degree distribution was scale- 
free and the out-degree distribution exponential. So, for exponential distributions the ratio 
of the relative sizes of the giant interfaces to the corresponding in- or out-component grows 
quadratically with the average degree. This result is very interesting because it suggests that, 
at least in this case, the traditional GIN and GOUT components of the node percolation 
map show a shallow architecture mainly formed by leaf edges emanating from or pointing 
to the SCC. As a consequence, and to give a mental image, the bow-tie structure of those 
networks rather becomes a "hairy ball" . 

All this points to a rich second order fine structure that could play a central role in the 
investigation of how topology is related to functionality. In particular, and apart from the 
information contained in both the node and edge percolation maps, the internal structure 
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of the interfaces, and more specifically the distinction of leaf edges from connectors, is 
fundamental in order to assess the efficiency of the global flow or the risks of bottleneck 
effects. Further discussion about the internal structure of interfaces and possible implications 
will be provided in a forthcoming work. 



A. Internal average degrees 

Interfaces have a hybrid nature from the point of view of node components. In order to 
calculate internal average degrees, it is not clear whether they should be assigned to one 
node component or another. If one considers for instance the subset of nodes in the SCC 
with all their connections, internal or not, it is found that 

^SCC ki YlsCC ko / o/i \ 
-jjj - 9sce + gitf J £ - 9sce + 9otf ■> 1/4 J 

where E is the total number of edges in the network. As a consequence, the detailed balance 
condition Eq. ([3} will not be accomplished in general, Ylscc K ^ J2scc exce Pt when 
both interfaces are of equal sizes. The same happens for the subsets of nodes in the GIN 
and the GOUT, where from the point of view of detailed balance there is an excess out- and 
in-degree respectively. The interfaces are precisely the responsible for these imbalances. 

We explore once more as a null model that can be fully calculated analytically the com- 
pletely uncorrelated network, with no local or degree-degree correlations. In this situation, 



the sizes of t 
Refs. [l6, 19, 



re main components in the node percolation map can be expressed as (see 



23|) 



g scc = 1 -<7 (1) -<7i(l) +g {l)g~i{l) 
g in = 1 - <? (1) - g scc 

g ou t = 1 - <?i(l) - g S cc, (25) 

where <?;(1) and g (l) are the solutions of Eq. ffl8l) . like for the edge percolation map. 
Comparing Eq. (125^) and Eq. (1171) . it is found that the relative sizes in number of nodes of the 
GIN, GOUT, and SCC are the same as the relative sizes in number of edges of the ICE+ITF, 
OCE+OTF, and SCE respectively. In other words, the average degree of the whole network 
is preserved in the different components if the in- and out-interfaces are assigned to the in- 
and out-components respectively. This is in particular valid for the previous examples of 
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uncorrelated networks with exponential or scale-free in-degree distribution and exponential 
out-degree distribution. 

V. CONCLUSIONS 

We focus on edges instead of nodes to investigate analytically how they organize in the 
percolated phase of purely directed random networks. Interfaces of edges are found to 
bridge the main components of the node percolation map. The general case of local and 
degree-degree correlations is formally solved and the relative sizes of the five main giant 
edge components are characterized quantitatively. The results for uncorrelated networks 
are found to be in very good agreement with direct measures on synthetic networks, that 
could present very different edge percolation maps depending on the in- and out-degree 
distributions and the average degree. 

The node percolation map is in this way complemented by the edge percolation map, 
forming a percolation landscape that gives a more detailed topological description of the 
structure of globally connected systems. The work should not stop here, since results in this 
paper seem to point out to the importance of the internal organization of the interfaces with 
latent implications at the level of functional properties. So, the analysis presented in this 
work uncovers a new aspect potentially relevant not only for the structure of directed net- 
works but most importantly for their functionality. Generally, the SCC processes input into 
output so that interfaces become unavoidable bridges that could determine the effectiveness 
or the robustness of functional performance. 

In this work, we have restricted to purely directed networks, a good approximation in 
many cases where flow or transport, when present in both directions, is asymmetric. Nev- 
ertheless, the same ideas can be extended to semi-directed networks, the most general and 
realistic ones. For those, analytical calculations could be a bit more intricate due to the 
non-trivial correlations associated to reciprocity. 
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